Scheduling computations with provably low synchronization overheads

نویسندگان

چکیده

We present a Work Stealing scheduling algorithm that provably avoids most synchronization overheads by keeping processors’ deques entirely private default and only exposing work when requested thieves. This is the first paper obtains bounds on are (essentially) independent of total amount work, thus corresponding to great improvement, in both design theory, over state-of-the-art algorithms. Consider any computation with $$T_{1}$$ critical-path length $$T_{\infty }$$ executed P processors using our scheduler. Our analysis shows expected execution time $$O\left( \frac{T_{1}}{P} + T_{\infty }\right) $$ , incurred during at \left( C_{\mathrm{CAS}} C_{\mathrm{MFence}}\right) PT_{\infty where $$C_{\mathrm{CAS}}$$ $$C_{\mathrm{MFence}}$$ respectively, denote maximum cost executing Compare-And-Swap instruction Memory Fence instruction.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Measuring Synchronisation and Scheduling Overheads in OpenMP

| Overheads due to synchronisation and loop scheduling are an important factor in determining the performance of shared memory parallel programs. We present set of benchmarks to measure these classes of overhead for language constructs in OpenMP. Results are presented for three diierent hardware platforms, each with its own implementation of OpenMP. Signiicant diierences are observed, which sug...

متن کامل

Simulation of Real-time Multiprocessor Scheduling with Overheads

Numerous scheduling algorithms were and still are designed in order to handle multiprocessor architectures, raising new issues due to the complexity of such architectures. Moreover, evaluating them is difficult without a real and complex implementation. Thus, this paper presents a tool that intends to facilitate the study of schedulers by providing an easy way of prototyping. Compared to the ot...

متن کامل

Reducing Scheduling Overheads for Concurrent Logic Programs

Strictness analysis is crucial for the efficient implementation of the lazy flmctionM languages. A related technique for the concurrent logic languages (CLLs) called schedule analysis is presented which divides at compile-time a CLL program into threads of totally ordered atoms, whose relative ordering is determined at run-time. The technique enables the enqueuing and dequeuing of processes to ...

متن کامل

FIFO with Offsets: High Schedulability with Low Overheads

The OS scheduler’s memory and runtime overheads form crucial design constraints for embedded systems implemented on low-cost hardware platforms. Table-driven scheduling can provide a high level of schedulability; however, it also consumes significant amounts of memory. By contrast, effective non-preemptive scheduling policies, such as the non-workconserving Critical-Window EDF (CW-EDF), have lo...

متن کامل

Scheduling Imprecise Computations with Wind-up Parts

The imprecise computation model provides the ability to cope with unpredictable workloads. However, there is no consistent way on how to terminate the computation in its early stage. This paper describes a novel approach for safely terminating imprecise computations. First, a new logical part called wind-up part is added to the imprecise computation model. This wind-up part is used by applicati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Scheduling

سال: 2021

ISSN: ['1099-1425', '1094-6136']

DOI: https://doi.org/10.1007/s10951-021-00706-6